Error Dynamics Based Dual Heuristic Dynamic Programming for Self-Learning Flight Control

نویسندگان

چکیده

A data-driven nonlinear control approach, called error dynamics-based dual heuristic dynamic programming (ED-DHP), is proposed for air vehicle attitude control. To solve the optimal tracking problem, augmented system defined by derived dynamics and reference trajectory so that actor neural network can learn feedforward feedback terms at same time. During online self-learning process, learns policy minimizing system’s value function. The input identified recursive least square (RLS) output of critic are used to update network. In addition, total uncertainty term also RLS, which compensate caused inaccurate modeling, parameter perturbation, on. outputs ED-DHP include rough trim surface, from network, compensation. Based on this scheme, complete knowledge not needed, offline learning unnecessary. verify ability ED-DHP, two numerical experiments carried out based established morphing model. One sinusoidal signal a fixed operating point, other guidance command with process variable points. simulation results demonstrate good performance validate robustness scheme

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dual Heuristic Programming for Fuzzy Control

Overview material for the Special Session (Tuning Fuzzy Controllers Using Adaptive Critic Based Approximate Dynamic Programming) is provided. The Dual Heuristic Programming (DHP) method of Approximate Dynamic Programming is described and used to the design a fuzzy control system. DHP and related techniques have been developed in the neurocontrol context but can be equally productive when used w...

متن کامل

Reinforcement Control via Heuristic Dynamic Programming

Heuristic Dynamic Programming (HDP) is the simplest kind of Adaptive Critic which is a powerful form of reinforcement control 1]. It can be used to maximize or minimize any utility function, such as total energy or trajectory error, of a system over time in a noisy environment. Unlike supervised learning, adaptive critic design does not require the desired control signals be known. Instead, fee...

متن کامل

Electromagnetic Formation Flight Control Using Dynamic Programming

Electromagnetic formation flight (EMFF) is an enabling technology for a number of spacecraft mission architectures. The RINGS program will be the first time EMFF is demonstrated in a microgravity environment. Nonlinearities due to magnetic field interactions preclude linear feedback controllers from being used to control the RINGS system. Approximate dynamic programming is explored in this pape...

متن کامل

Extracting Dynamics Matrix of Alignment Process for a Gimbaled Inertial Navigation System Using Heuristic Dynamic Programming Method

In this paper, with the aim of estimating internal dynamics matrix of a gimbaled Inertial Navigation system (as a discrete Linear system), the discretetime Hamilton-Jacobi-Bellman (HJB) equation for optimal control has been extracted. Heuristic Dynamic Programming algorithm (HDP) for solving equation has been presented and then a neural network approximation for cost function and control input ...

متن کامل

Natural Heuristic Dynamic Programming for Dynamic Systems

Heuristic Dynamic Programming (HDP) is the simplest kind of Adaptive Critic 1]. It can be used to maximize or minimize any utility function, such as total energy or trajectory error, of a system over time in a noisy environment. In this article, we propose a new version of HDP, called NHDP (Natural Heuristic Dynamic Programming). This new version incorporates basic HDP algorithm with the follow...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13010586